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LEGACY MIL-STD-1750A SOFTWARE 
EMULATOR ADDRESS TRANSLATION 
USING POWER PC MEMORY 
MANAGEMENT HARDWARE 

CROSS REFERENCE TO RELATED 
APPLICATIONS 

This application is a continuation-in-part of commonly 
owned application Ser. No. 09/002,960, now U.S. Pat. No. 
6,041,402, filed on Jan. 5, 1998. This application is also 
related to commonly owned copending patent application, 
filed on even date, entitled; "Concurrent Legacy and Native 
Code Execution Techniques"; by T. Hoerig, et al., applica- 
tion Ser. No. 09/451,156. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to a memory management 
system and more particularly to memory management sys- 
tem which can be used with a software emulator which 
meets MIL-STD-1750A with respect to paging and protec- 
tion attributes, such as block protection and access lock and 
key functions. 

2. Description of the Prior Art 

It is known that microprocessors are configured with 
different instruction set architectures (ISA). The ISA deter- 
mines the instruction set for a particular microprocessor. 
Application programs are executed by the microprocessors 
normally written in relatively high level language, which is 
compiled into machine instructions compatible with the 
instruction set for the specific microprocessor. Microproces- 
sors are increasingly being designed to execute instructions 
faster and faster. As such, systems incorporating such micro- 
processors are often upgraded to increase the speed of the 
system. Unfortunately, depending on the particular upgrade, 
often times the instruction set of the upgrade microprocessor 
is incompatible with the instruction set of the microproces- 
sor to be replaced ("legacy microprocessor"). As such, in 
such applications, the existing application programs often 
need to be rewritten in new and modem computer languages 
with modem compilers. Unfortunately, such an undertaking 
can be quite cumbersome and expensive. 

Due to the age and obsolescence of many existing avionic 
onboard computers, the reliability of such computers is 
rapidly declining while maintenance is becoming more 
difficult and costly to achieve. As such, it is sometimes 
required to replace outdated "legacy" microprocessors with 
newer technology "native" microprocessors. To work 
around instructions set incompatibilities, emulation systems 
(emulators) have been developed. Emulators are known 
which emulate the instructions set of the legacy micropro- 
cessor in order to enable the instructions of the legacy 
microprocessor to be "executed" by a different micropro- 
cessor. Both software and hardware based emulators are 
known. For example, various software emulators for the 
F-16 avionics integration support facility (AISF) common 
modular environment (COMET) are described in document 
no. F-16AISF-COMET-100 (EMULATORS -SWD -A, dated 
May 21, 1996). Hardware based emulators for military 
standard MIL-STD-1750A, are discussed in the document 
entitled Line Replaceable Unit Emulator Hardware Product 
Fabrication Specification, document no. SFF20702 dated 
Apr. 16, 1996. 

Unfortunately, known software emulators have been 
known to be relatively inefficient. In particular, in such 
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known software emulators, legacy instructions are fetched 
or the upgrade microprocessor which uses a look up table to 
interpret the legacy instruction. Since each legacy instruc- 
tion must be interpreted, computer systems which incorpo- 

5 rate cache memory are known to suffer from relatively high 
probability of cache misses which decreases the overall 
throughput of the system. 

Another problem with such software emulators is the need 
to comply with MIL-STD-1750A with respect to memory 

10 management. As used herein MIL-STD-1750A refers to 
"Military Standard Sixteen Bit Computer Instruction Set 
Architecture", dated Jul. 2, 1980, hereby incorporated by 
reference. MIL-STD-1750A sets forth a virtual paging and 
protection scheme that maps sixteen (16) 64K word logical 

15 operand spaces and sixteen (16) 64K word logical instruc- 
tion spaces into 1 M word of physical memory. Each 64K 
word logical space is controlled by sixteen (16) logical page 
registers which control the logical to physical mapping as 
well as the protection attributes, such as execute protect and 

20 write protect, as well as an access lock and key functions. 
Block protection is specified for physical memory in blocks 
of IK word size. Block protection over-rides page register 
protection. In other words, if the block is write protected, it 
is write protected regardless of the write protection setting in 

25 the page registers. 

To accurately model the protection attributes in a software 
emulator, each operand access and each instruction access 
require software to look up the current settings for the page 
register and determine the protection. If the attribute enables 

30 access, then the access is performed, otherwise a fault is 
generated. Such protection attributes are known to seriously 
degrade the performance of such software emulators. Thus, 
there is a need to efficiently implement paging and block 
protect, and lock and key functions which meet MIL-STD- 

35 1750A for a software emulator. 

SUMMARY OF THE INVENTION 

Briefly the present invention relates to a system and 
method for implementing the paging and protection 

40 attributes, such as block protection and access lock and key 
functions promulgated in MIL-STD-1750A. The present 
invention takes advantage of the PowerPC microprocessor 
architecture to implement the paging and protection 
attributes required by MIL-STD-1750A in hardware. Since 

45 the paging and the protection attributes are implemented in 
hardware, the system performance is greatly improved. 

BRIEF DESCRIPTION OF THE DRAWING 

These and other objects of the present invention will be 
50 readily understood with reference to the following specifi- 
cation and attached drawing, wherein: 

FIG. 1 is a block diagram illustrating the translation of the 
legacy instruction in accordance with the present invention. 
55 FIG. 2 is a block diagram illustrating one embodiment (in 
which vector translation is done by hardware) for translating 
the legacy instructions in accordance with the present inven- 
tion. 

FIG. 3 is a block diagram illustrating the mapping of the 
60 legacy instructions to main memory. 

FIG. 4 is a block diagram of an upgrade microprocessor 
with cache memory for illustrating the operation of the 
cache system in accordance with the present invention. 
FIG. 5 is a diagram of the memory mapping in accordance 
65 with the present invention. 

FIG. 6 is a diagram illustrating the virtual paging archi- 
tecture of a PowerPC microprocessor. 
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FIG. 7 is an exemplary block diagram of the system in 2 and INSTR 3, etc. The ROM 20 also includes immediate 

accordance with the present invention. data. The instructions INSTR 1, INSTR 2, INSTR 3, etc. 

FIG. 8 is a bit map of the word format used in logical page P^s the immediate data in the ROM 20, are located, for 

registers in accordance with MIL-STD-1750A. example, at a base address A within the memory space. 

FIG. 9 is a bit map of a status word register in accordance 5 In accordance with an important aspect of the invention, 

with MIL-STD-1750A. eacn instruction (i.e. instruction INSTR 1, INSTR 2, INSTR 

ttt^ m • ui i a- ™ t *u du.^k^™ ncn* 3, etc.) is translated to a direct vector to a software routine 

FIG. 10 is a block diagram of the PhysMem 1750A ' > . , _ , . . , . t „ 

c t . u1 , .„ . * j - F7T/- n or emulation code for emulating the legacy instruction, tor 

function block illustrated in FIG. 7. , , . . t . & . 6 ' t . t , 4 

c i_i i example, each legacy instruction is mapped or translated to 

FIG. 11 is a filter for the PageReg 1750A function block 10 another memory device 22 such ^ a R0M 22 The structure 

illustrated in FIG. 11. of me memory device 22 with respect to the instructions, is 

FIGS. 12A-12D are flowcharts representative of the maintained the same as the ROM 20 but at a different base 

MemMap 1750A function block illustrated in FIG. 7. address B. In other words, instructions in the ROM 20 are 

FIG. 13A is a block diagram for BlkProt 1750A function located at a base address A plus an instruction counter (IC) 

block illustrated in FIG. 7. 15 onset which corresponds to the next legacy instruction to be 

FIG. 13B is a block diagram for BlkProt Update function executed. The instructions, INSTR1, INSTR 2, INSTR3, etc. 

in accordance with the present invention, are mapped to the ROM 22 at a different base address B but 

FIG. 13C is a block diagram for a Blk Write function in with ^ samc IC ofiEset ' 

accordance with the present invention. _ n The direct vectors in the ROM 22 can either be JUMP 

CTP . .1 -I i • p Tin d a Ft instructions to software routines for emulating the legacy 

FIG. 13D is a block diagram of a Blk Read function in . „ 6 . ,? \ 

, « • instruction or address pointers. For example, the direct 

accordance with the present invention. t i ,- 

y vectors can represent an offset pointer to an emulation 

FIG. 14A is a block diagram of an ASchange Update microcode rou tine or a pointer to a table which contains a 

function in accordance with the present invention. pointef tQ a microcode routinei Regardless, it is these vectors 

FIG. 14B is a block diagram for a PSchange Update 25 mat are fetched by the emulation software rather than the 

function in accordance with the present invention. legacy instructions. 

FIG. ISA is a block diagram of a SW 1750A function xhe immediate data may be translated into bogus vectors 

block illustrated in FIG. 7. in the ROM 22 which are not used by the emulation 

FIG. 15B is a block diagram of an Update SW function in 3Q software. Rather, the emulation software in accordance with 

accordance with the present invention. present invention may access the immediate data directly 

from the legacy code 20 by reading the data directly at the 

DETAILED DESCRIPTION OF THE 5ase address A from ^ ROM 20. 

INVENTION Various methods are suitable for translating the legacy 
The present invention relates to a emulation system which 35 microcode. Both hardware and software methods for trans- 
meets MIL-STD-1750A. FIGS, 1-4 relate to a software lating these instructions are suitable. FIG. 2 illustrates a 
emulation system and method for emulating legacy instnic- hardware implementation in which a hardware device 24, 
tions of an outdated ("legacy") microprocessor with a new such as a ROM containing a lookup table, is coupled to the 
upgrade microprocessor with an incompatible instruction set data bus between an upgrade microprocessor 26 and the 
FIGS. 5-9 relate to a method for implementing the paging 40 legacy microcode, i.e. ROM 20. The hardware device 24 is 
and protection attributes, such as block protection and configured such that at any time an access to the legacy code 
access lock and key functions features specified in MIL- is requested, i.e. base address B plus IC offset, the vector 
STD-1750A. corresponding to the requested instruction is provided. 

Alternately, the decoder can be bypassed such that an access 

DIRECT VECTORED LEGACY INSTRUCTION ^ t0 the legacy code (i e< base address A phls IC offeet) will 

SET EMULATION return the untranslated data. Thus, the upgrade processor can 

In applications in which a new upgrade "native" micro- be directed to the associated emulation code routine by the 

processor replaces an outdated "legacy microprocessor with fetched vector, or it can access immediate data directly from 

a different instruction set, software programs are written memory. 

which emulate each of the legacy instructions using instruc- 50 In an alternate embodiment of the invention, the legacy 

tions of the upgrade microprocessor. In known emulation emulation code may be translated by software when the 

systems, the emulation software causes the legacy instruc- legacy memory is loaded into main memory or modified. In 

tion to be fetched and interpreted by the new microprocessor particular, a software program, for example, a portion of the 

by way of the lookup table. As discussed in more detail initialization software, is used to load the legacy code, into 

below, such methodology has a significant impact on the 55 the computer system main memory 28 (FIG. 3) (e.g. at base 

throughput of the system. In order to increase the throughput address A). In this implementation, after loading the legacy 

of the system, the legacy instructions are translated into microcode, the direct vectors (i.e. ROM 22) are loaded into 

direct vectors to software routines or emulation code for the main memory 28 at another location (e.g. base address 

emulating the legacy instructions. As such, as will be B), allowing the emulation code in the upgrade processor to 

discussed in more detail below, the probability of cache 60 access either the translated 22 or untranslated 20 legacy 

misses is greatly reduced which results in increased through- memory sections. The vectors retrieved from the translated 

put of the system. memory 22 are used to point to the individual software 

Turning to FIG. 1, embedded software or code for a routines in the emulation code 30. With this embodiment of 

legacy microprocessor is generally identified with the ref- the invention, the translated 22 and untranslated 20 legacy 

erence numeral 20. Such code is normally stored in non- 65 memory sections need not be disjoint, they might be 

volatile read only memory (ROM). As shown, the ROM 20 interleaved, such that each vector immediately follows or 

includes legacy instructions, identified as INSTR 1, INSTR proceeds the untranslated instruction. 
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As mentioned above, the configuration of the method and LEGACY MIL-STD-175GA SOFTWARE 
system for emulating legacy instructions is adapted to 

improve the throughput of the system. More particularly, Emulator Address Translation 
many known microprocessors include cache memories in „. r . . 
order to improve the throughput of the system. Software s . Microprocessors and software used m airborne apphca- 
fetched from the main memory, is copied into the cache tl0ns are required to comply with MIL-STD-1750A. As 
memory, which is much quicker than main memory. Thus, discussed above, oftentimes outdated "legacy" microproces- 
instructions stored in the cache memory can be executed sors are replaced with newer "native" microprocessors, 
much quicker than those stored only in main memory. Such Since the native microprocessors are used with legacy 
cache memories are normally formed from high speed static software, software emulators have been developed to emu- 
random access memory (SRAM) and are used to store 1 late the instructions of the legacy microprocessor. These 
copies of data in the main memory or newly stored data. native microprocessors as well as the software emulators 
Such cache memories operate on the principles that most need to be in compliance with the aforementioned standard, 
programs execute instructions in sequence, and, due to One aspect of the aforementioned standard relates to 
loops, programs are likely to re -use recently fetched ins true- memory paging with associated protection attributes, such 
tions. This principle is call locality. Thus, instead of fetching 15 as block protection and access lock and key functions. In 
a single instruction at a time, a cache memory system looks particular, MIL-STD-1750A applies to 16 bit computer 
ahead and fetches blocks of instructions in sequence and systems and requ ires separate logical address space for 
stores the instructions for quick access. instructions and operands. In particular, MIL-STD-1750A 

In operation, all data stored in a cache memory is stored req uires sixteen (16) 64K word logical operand or data 

with what is known as an address tag. The address tag 20 spaces and sixteen (16) 64K word logical instruction spaces 

indicates the physical addresses of the data in the main {Q be d jnto t M word of h ical mem Each 64R 

memory that is being stored m the cache Whenever the wofd ^ controUed b sixteen (16) 

microprocessor initiates a memory access, the address tags * ^ ^ ' ^ J 

in the cache memory are first examined to determine if the . 6 , , f * j 

particular data requested is already stored in the cache 25 as the Protection attributes, such as execute protect and write 

memory. When the data is found in the cache memory, this P™ te *J£ *l e Lf m ac f ^ l oc * and ke y ; In f 6lh ™> ih * 

is known as a cache hit and data is immediately available to MIL-STO-1750A specifies block protection for physical 

the microprocessor. If it is determined that the requested data memor y m blocks of 1K wo / d sizes ^ dl f ussed above > 

is not in the cache memory, this condition is known as a mcctlD S ^ requirements of MIL-STO-1750A can cause 

cache miss. As a result of a cache miss, the requested data 30 senous P e rf° rmance degradation in software emulators, 

then must be retrieved from the main memory at a much The system in accordance with the present invention 

slower rate. utilizes the virtual paging architecture, available on micro - 

FIG. 4 illustrates a typical configuration of a micropro- processors constructed in accordance with the PowerPC 

cessor with onboard cache memory. In previous known architecture, as generally shown in FIG. 6, to implement the 

systems, the known emulation software fetched the legacy 35 P a S in S and P rotection attributes specified in M1L-STD- 

instructions themselves. The legacy instructions were then 1750A - Smce the P a S m S and protection attributes are imple- 

interpreted by way of lookup table. Since the cache memory mented in hardware, emulation of the paging with protection 

is based on the premise that a sequence of data will be attributes in accordance with MIL-STD-1750A operates 

requested in sequential memory locations, the use of the more efficiently than other known systems, 

lookup table is not particularly efficient in cache memory 40 As discussed above, the MIL-STD-1750A specifies that 

systems and results in a relatively high probability of cache the memory space be configured into sixteen (16) 64K word 

misses. By utilizing direct vectors, and because of effects of logical operand spaces (OPND 1750A) and (16) 64K word 

the locality principle in the legacy code and corresponding logical instruction spaces (INST 1750A). Referring to FIG. 

direct vectors, the probability of cache misses is greatly 5, only one of the 16 operand logical address spaces 36 and 

reduced thereby increasing the overall throughput of the 45 one instruction logical address spaces 38 is illustrated for 

system. More particularly, referring to FIG. 4, a memory convenience. Referring to the operand logical address space, 

system in accordance with the present invention is illus- each of the operand logical address spaces 36 is mapped by 

trated. As shown, the system includes the upgrade micro- sixteen (16) hardware logical page registers 40 into sixteen 

processor 26 which includes two onboard cache memories (16) emulator logical address space pages; each page gen- 

32 and 34. One cache memory 32 is used for data, forming 50 e rally identified with the reference numeral 43 and config- 

a data cache while the other cache memory 34 is used for ured as 4K word on 4K word boundaries. Thus, the sixteen 

instructions forming an instruction cache. The instruction (16) emulator operand logical address 64K word spaces 36 

cache 34 may be used almost exclusively for the emulation are mapped into 256 4 K word emulator pages 43 for a total 

microcode. The data cache 32 may be used for the legacy of 1 M word of address space. The mapping of the emulator 

code. 55 instruction logical address 38 is similar as illustrated in FIG. 

In operation, a group of eight vectors may be fetched from 5 and is mapped into the same 1 M word of address space, 
main memory upon a cache miss and stored in the data cache In accordance with an important aspect of the invention, 
32 as part of the cache line refill operation. Since legacy the paging requirements set forth in MIL-STD-1750A can 
instructions normally proceed in sequential order, the sub- easily be implemented by utilizing a microprocessor con- 
sequent 7 requests for instruction vectors will normally be 60 forming to the PowerPC architecture. In order to take 
resident in the data cache 30. If the next legacy instructions advantage of the PowerPC architecture with respect to 
to be executed is non -sequential but is within the last one paging as well as the protection attributes, the system in 
thousand instructions to executed, (i.e. local loops), there is accordance with the present invention represents each word 
a high probability that the vector will still be in the data in the emulator pages 43 as 4 bytes to the Power PC 
cache 30. This invention has reduced the probability of 65 environment. As such, this allows one (1) emulator operand 
cache misses and thus increased the throughput of the page 43 to be mapped to four (4) PowerPC pages 44,46,48 
system. and 50 of physical memory as shown in FIG. 5. Thus, each 



09/23/2003, EAST Version: 1.04.0000 



US 6,212,614 Bl 
7 8 

emulator operand 64K word logical address space 36 is MIL-STD-1750A is illustrated in FIG. 7. As shown in FIG. 

mapped to sixteen (16) emulator logical 4K word address 7, blocks which include the suffix -1750A are part of the 

pages 43, which, in turn, are mapped to four (4) PowerPC software emulator. 

pages; each power PC page being 4K bytes. The sixteen (16) Referring to FIG. 7, 1 M word of physical memory is 
emulator operand 64K word logical address spaces will thus 5 represented by a PhyMem 1750A function block, identified 
map to 1024 PowerPC 4K byte pages 36. As such, one (1) with the reference numeral 56. In particular, the PhysMem 
Power PC page will map to a IK word block of the emulator 1750A block 56 represents 1 M word of physical memory 
instruction logical address space 38. By so configuring the which may have additional attributes, for example; 
PowerPC microprocessor, one (1) PowerPC page (4K bytes) NOMEMORY indicating physical memory not imple- 
will map to a IK word block of the emulator operand logical mented; ROM indicating physical memory as read only; 
address space. Since the PowerPC architecture provides for ram indicating physical memory as read write and NON- 
block protection of each PowerPC page, which, in turn, is CACHEABLE indicating that the physical memory is off- 
mapped to IK word of emulator operand logical address board and should not be cached by the PowerPC micropro- 
space 36, block protection down to a granularity of IK word cessor. The PhysMem 1750A function block 56 manages the 
as required by MIL-STD-1750 is satisfied. i5 physical memory in IK word blocks. 

Implementation of the virtual paging scheme by way of a a diagram of the memory attribute table 57 contained in 

microprocessor configured to the PowerPC architecture can tn e PhysMem 1750A function block 56 is illustrated in FIG. 

be done rather conveniently and easily. Detailed information io. The memory attribute table 57 is divided into 1024 

for programming a microprocessor conforming to the Pow- entries, such as the entries 70, 72, 74 and 76. Each entry 70, 

erPC architecture is set forth in "PowerPC Microprocessor 2Q 72, 74, and 76 consists of a PowerPC address field, generally 

Family: The Programmers Reference Guide, document No. identified with the reference numeral 78, and a protection 

MPCPRG/D, dated October, 1995, available from Motorola, attribute field 80. As discussed above, 4K bytes in the 

hereby incorporated by reference. PowerPC address field 78 are mapped into IK words in 

The virtual paging architecture of the PowerPC micro- order to provide protection attributes down to a IK word 

processor is best understood with reference to FIG. 6 and as 25 block level. Thus, each entry in the PowerPC address field 

generally described in "Computer Architecture", by Robert corresponds to 4K bytes. As discussed above, by mapping 

J. Baron and Lee Higbie, published by Addison-Wesley each word in the emulator environment to 4 bytes in the 

Publishing Company, copyright 1992, pages 206-208, PowerPC environment, protection can be provided down to 

hereby incorporated by reference. In general, logical the IK word level in order to comply with MIL-STD-1750A 

addresses are divided into two parts, a virtual page number 30 As discussed above, the logical address space is divided 

and a word offset within a page. The division is done by up mto (16) operand logical address spaces 36 and sixteen 

partitioning the bits of the logical address, in this case, the (16) instruction logical address spaces 38. These operand 

logical addresses of the emulator operand and instruction logical address spaces 36 and the instruction logical address 

pages 36 and 38. Normally, the higher order bits are used for spaces 33 are eacn formed from sixteen (16) 64K word 

the page number while the lower order bits represent the 35 logical address spaces to provide 1 M word logical address 

offset. In order to account for the translation between logical space f or use f or operands and instructions. The sixteen (16) 

and physical address locations within the memory, the logical operand spaces 36 and the sixteen (16) instruction 

operating system of the microprocessor normally utilizes a spaces 38 are presented as windows in the PowerPC 

page table 52 in main memory. Such a page table 52 memory space. All accesses to the operand 36 and instruc- 

normally includes control information for each memory 40 tion spaces 38 by the software emulator are by way of micro 

page. An exemplary page table 52 is illustrated in FIG. 6 and coc je routines, identified with the reference numerals 58 and 

includes a column for a validity bit (V), which indicates 60 respectively. As discussed above, each of the sixteen (16) 

whether the page is in memory; a dirty bit (D) which 64K word operand logical address spaces 36 and instruction 

indicates whether the program has modified the page; and a logical address spaces 38 are mapped by logical page 

protection field which indicates which users may access the 45 registers 40 and 42, discussed above; each hardware register 

page and how (for example read only or read and write) and 40, 42 mapping a 64K word address space to sixteen (16) 4K 

a page frame number for the page. worc j pages 43 on 4K word boundaries as generally shown 

Each row in the page table 52 represents a 4K byte page in FIG. 5. 

of memory in the PowerPC architecture environment. As a bit map of the word format for the logical page registers 

discussed above, since each page in the PowerPC architec- 50 40, 42 in accordance with MIL-STD-1750 is illustrated in 

ture environment is used to represent IK word of legacy F IG. 8. As shown, the field AL (bits 0-3) relates to an 

memory, block protection in IK word blocks is easily optional access lock and key feature. If the access lock and 

accomplished, since the PowerPC architecture provides for k ey feature is not used, bits 0-3 are reset to zero. If the 

block protection for each PowerPC page. As such, the block access ] oc k key feature is used, bits 0-3 define the 

protection at the IK word level of granularity for the 5S access lock code for the associated page register, used with 

emulator logical address space as required by the MIL-STD- the access key codes, discussed below, to determine access 

1750A is greatly simplified and implemented simply by the permission. Permissible values of the access lock codes and 

block protection provided by the PowerPC microprocessor acccss k cy coc j es arc prov ided in Table VII of the MIL- 

architecturc. STD-1750A. Bit 4 is a dual functional bit; the function of 

As will be discussed below, the PowerPC architecture also so which is determined by whether it is used with an instruction 

includes a translation look aside buffer (TLB) 54 (FIG. 6). page register or an operand page register. For instruction 

The TLB 54 is an address translation cache which essentially page registers, bit 4 is defined as an E bit and determines the 

has the same information as the page table 52 as generally criteria for reading for instruction fetches. When E=l, any 

shown in FIG. 6 to reduce accesses to the main memory and attempted instruction read designating the page register is 

speed up the overall throughput of the system. 65 denied access and a fault is generated. For operand page 

An exemplary block diagram for implementing the virtual registers, bit 4 is defined as a W bit and used to determine 

paging and block protection requirements specified by the criteria for permitting write references to an operand 
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page register. When W=l, attempted writes to the associated 
page register are not permitted and a fault is generated. Bits 
5-7 are reserved and are all set to 0. The PPA filed (bits 
8-15) is used to define the physical page address. 

The Power PC architecture includes a page table, such as 
the page table 52 illustrated in FIG. 6, for mapping the 
logical address space to physical address space. More 
specifically, an instruction page table 60 is provided to map 
the emulator instruction logical address space 38 to physical 
memory. Similarly, a Power PC operand page table 61 is 
used for mapping the operand logical address space 36 to 
physical memory. The logical addresses in the operand 
logical address space 36 and the instruction logical address 
space 38 are partitioned as shown in FIG. 6 with the higher 
order bits in the logical address space corresponding to page 
frame numbers in the page table 52 and the lower order bits 
representing the byte offset for the address of the operand or 
instruction within that particular memory page 43. Such 
page tables are normally maintained in main memory. In 
order to speed up memory accesses, the PowerPC architec- 
ture also keeps an internal cache of the most recendy used 
page table entries, called a translation look aside buffer 
(TLB). In particular, whenever the PowerPC microprocessor 
performs a memory access, it compares the logical address 
of the access against entries in the TLB 54 and if it finds a 
match it uses the physical address in the TLB 54 to perform 
the access. If the required logical address is not in the TLB 
54, the PowerPC operating system searches the page table 
52 looking for a match. If a match is found it is loaded into 
the TLB 54 and the access is completed. If the required entry 
is not found in a page table 52, a page fault is then generated. 

Referring to FIG. 7, all updates to the software emulator 
page registers 40 and 42 are through specialized input/output 
instructions (XIO's) that map one XIO location to one page 
register. These XIO's are intercepted by a PageReg 1750A 
function block 62 and passes them to a MemMap 1750A 
function block 64. 

A flow chart for the PageReg 1750A function block 62 is 
illustrated in FIG. 11. As discussed above, the PageReg 
1750A function block 62 intercepts the XIO's in step 82 and 
determines in step 84 whether the XIO's are for an operand 
or instruction page register by examining the port address of 
the XIO. As discussed above, the MIL-STD 1750A requires 
separate logical operand and logical instruction spaces. 
Once the system determines whether the XIO is for an 
operand or instruction page register, the system then deter- 
mines in step 86 or 88 whether the XIO is a read or write. 
After determining whether the XIO is an operand or an 
instruction page register access and whether it is a read or 
write access of that page register, the system calls the 
MemMap 1750A function block 64, as indicated in steps 90, 
92, 94 and 96. As will be discussed in more detail below, the 
MemMap 1750A function block 64 merely returns the page 
register value in steps 98 and 100 for read operations. For 
write operations, as will be discussed in more detail below, 
the PowerPC page tables are manipulated and the protection 
attributes are calculated and returned in steps 98 and 100. 

Referring to FIG. 7, when a data access is performed by 
the software emulator microcode (Ucode) routine 58 or 60 
to the operand or instruction logical address space 36 or 38, 
the PowerPC TLB 54 (FIG. 6) is searched to determine if it 
contains an entry with the virtual page number of the 
PowerPC effective address of the operand or instruction data 
access 36 or 38. If the TLB 54 does contain an entry with the 
virtual page number of the access, the PowerPC effective 
address is automatically translated to a corresponding Pow- 
erPC physical address according to FIG. 6 and the protection 
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attributes of that entry are checked. If no entry in the TLB 
54 contains the virtual page number of the access, the 
MemMap 1750A function block 64 updates the PowerPC 
page tables with the required synthesized page table entries 

5 and restarts the PowerPC instruction. If the protection 
attribute check indicates that the access being performed 
fails to meet the page table entry protection requirements, a 
PowerPC protection fault is generated. The MemMap 
1750A function block 64 intercepts the PowerPC protection 

0 faults resulting from the attempted PowerPC access that did 
not meet the page table entry protection requirements. If the 
logical address of the fault is associated with an operand or 
instruction in the logical address spaces 36 or 38, an 
exception is generated and passed back to the software 

15 emulator. 

A flow chart for the MemMap 1750A function block 64 is 
illustrated in FIGS. 12A-12D. As mentioned above, the 
PageReg 1750A function block 64 determines whether the 
XIO is for an instruction or an operand page register and 

20 whether the XIO is a write or read operation. For instruction 
and operand read operations, as illustrated in FIGS. 12Aand 
12B, the page register value is returned as indicated in steps 
102 and 104. As discussed above, the PowerPC tables 
include the protection attributes for each entry in the 

2j memory attribute table 57. As such, the protection attributes 
are returned in steps 102 and 104. 

For instruction and operand page register write 
operations, the PowerPC page tables are manipulated and 
the protection attributes are calculated as indicated in FIGS. 

30 12C and 12D. In particular referring to FIGS. 12C and 12D, 
for both instruction and operand page register write 
operations, the protection attributes are calculated in steps 
106 and 108, depending on the value in the AL field (FIG. 
8) of the logical page register 40 or 42. As discussed above, 

35 the AL field relates to an optional access lock and key 
feature. If the access lock and key feature is not used, this 
field is reset to zero. Subsequently, in steps 110 and 112 the 
EAV field from the page register 40 or 42 is read and the 
protection is calculated based in the value of the E/W field. 

40 As discussed above, the instruction page registers define an 
E bit which determines the criteria for reading instruction 
fetches. For operand page registers, this bit is defined as a W 
bit and is used to determine the criteria for permitting writes 
to an operand page register. Subsequently, in steps 114, 116, 

45 118 and 120 after the protection is determined based on the 
AL field and EAV field, the PowerPC page table entries are 
updated. In addition, the TLB 54 entries are flushed. 

Block protection XIO's are intercepted by a BlkProt 
1750A function block 66 and passed to the MemMap 1750A 

50 function block 64, All updates to the software emulator 
block protects occur through the specialized XIO's that map 
one XIO location to sixteen (16) block protect entries (1 bit 
per block). These block protection XIO's are passed to the 
Power PC page table 52 (FIG. 6) to specify write protection 

55 for each physical block. As discussed above, the Power PC 
architecture provides for write protection for each Power PC 
page. Since one Power PC 4K byte page corresponds to IK 
word of logical address space, block protection is provided 
down to IK word granularity level as required by MIL- 

6 o STD-1750A. 

A flow chart for the BlkProt 1750A function block 66 is 
illustrated in FIGS. 13A-13D. Referring to FIG. 13A, 
initially the system determines whether the XIO is a read 
operation or a write operation or an enable function for the 

65 enabling the block protect feature in step 122. In steps 124, 
125 and 130, the system calls the MemMap 1750A function 
block 64. In particular, for read operations the system calls 
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MemMap 1750A and executes a BlkRead operation in step 
126 as illustrated in FIG. 13D. As shown in FIG. 13D, the 
result of a BlkRead operation is a return of the block 
protection value in step 127, which is subsequently returned 
in step 128 (FIG. 13A) to complete the XIO read operation. 
For write operations, the system MemMap 1750A executes 
a BlkWrile operation in step 130. As shown in FIG. 13C, the 
BlkWrite operation makes a call to a BlkProt Update func- 
tion as indicated in FIG. 13B in step 132. The BlkProt 
Update function calculates the value of the protection 
attribute based on the new value in step 134 for each page 
register entry that maps into the particular block. In step 136, 
the PowerPC pages are updated. In step 138 the PowerPC 
affected TLB entries are flushed. 

The selection of the currently active set of sixteen (16) 
instruction and sixteen (16) operand page registers 42 and 40 
is controlled by the value of the AS field of the 1750A status 
word (SW) emulated in the GPR1750A function block 68, as 
set forth in MIL-STD-1750A. All explicit program reads and 
writes of the SW are made through special XIOs which call 
the SW1750A function block 67. 

A bit map for the status word register is illustrated in FIG. 
9 as specified in MIL-STD-1750A. The status word register 
is a 16 bit register and includes a condition status (CS) field 
(bits 0-3), defined in the MIL-STD-1750A. Bits 4-7 of the 
status word register are reserved. Bits 8-11 relate to a 
processor slate (PS) field. This field defines the access key 
for use with the access lock and key protection attribute. 
This PS field defines the memory access key code for all 
instruction and operand references to memory. Hie PS field 
also determines the criteria for execution of privileged 
instructions. When PS-0, privileged instructions are allowed 
to be executed. When PS*0, privileged instructions are 
aborted and a fault is generated. Bits 12-15 relate to an 
address state (AS) field. The AS field essentially defines the 
particular one of 16 logical address spaces 36 or 38 that is 
requesting access. The access key codes are read by the 
Power PC and used to calculate the protection for the 
protection field in the page table 52 (FIG. 6). 

A block diagram for the SW 1750A function block 67 is 
illustrated in FIG. 15A. As shown, the system receives 
XIO's and determines in step 152 whether the XIO is a write 
or a read operation. For write operations, the system calls the 
function blocks GPR 1750A in step 154 and executes an 
update SW instruction as discussed below. For read opera- 
tions the system also calls the function block GPR 1750Ain 
step 156 and executes a Get SW function and returns a status 
word value in step 158. The GetSW function simply returns 
the current status word. 

FIG, 15B is a simplified flow chart for an Update SW 
instruction. In response to a request to write a new status 
word, the new status word is saved in step 160. In step 162, 
the system determines whether the address state AS field has 
changed in the page registers 40, 42. If not, the system 
proceeds to step 164 and determines whether the processor 
state PS field has changed. If not, the system returns in step 
166. If the PS field has changed, the system calls the 
function block MemMap 1750A in step 168 and executes a 
call to the PS change update, illustrated in FIG. 14B, in step 
168. If the AS field has changed as determined in step 162, 
the system makes a call to the function block MemMap 
1750 A in step 170 and executes an AS change update as 
indicated in FIG. 14A. 

Flow charts for updating the AS and PS fields are illus- 
trated in FIG. 14Aand FIG. 14B. Updates to the AS field are 
indicated with the reference numeral 140. The changes to the 
AS field can be used for any number of customized functions 
as indicated by block 142. 
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Changes to the PS field are indicated in FIG. 14B. As 
shown, for each instruction and operation page, as indicated 
by the function block 144, the system calculates the protec- 
tion based on the new value for the PS field in step 146. In 
5 step 148 the PowerPC page register tables are updated in 
step 148 with the new values for the protection attributes. 
Subsequently, the system flushes the PowerPC TLB entries 
in step 150 and returns to step 144 for processing another 
instruction or operand page register. 

10 Power PC interrupts are handled by a Power PC Interrupt 
Service Routine 70. These interrupts are passed to the 
software emulator and in particular an Interrupts 1750A 
object 71. Interrupt handling does not form a part of the 
present invention, 

15 Obviously, many modifications and variations of the 
present invention are possible in light of the above teach- 
ings. Thus, it is to be understood that, within the scope of the 
appended claims, the invention may be practiced otherwise 
than as specifically described above. 

20 

What is claimed and desired to be secured by Letters 
Patent of the United States is: 

1. A memory system comprising: 

one or more logical page registers, at least one of said 
25 logical page register configured to map 64K bytes of 
logical address space to sixteen (16) 4K word logical 
pages on 4K word boundaries; and 
means for mapping each of said 4K word logical pages to 
four (4) 4K byte physical address pages, wherein each 
30 word in logical address space is mapped to 4 bytes in 
said physical address space; wherein said mapping 
means implemented in hardware. 

2. The memory system as recited in claim 1, further 
including means for providing one or more protection 

35 attributes for said memory. 

3. The memory system as recited in claim 2, wherein said 
protection attribute is block protection of IK word blocks of 
physical memory space. 

4. The memory system as recited in claim 2, wherein said 
40 protection attribute is access lock and key. 

5. The memory system as recited in claim 1, wherein said 
mapping means includes a microprocessor conforming to 
the Power PC architecture. 

6. A method for mapping logical address space to physical 
45 memory space, the method comprising the steps of: 

(a) providing one or more logical page registers config- 
ured to map 64K words of logical address space to 
sixteen (16) 4K word logical pages on 4K word logical 
pages on 4K word boundaries; and 
50 (b) mapping each of said 4K word logical pages to four 
(4) 4K byte physical address pages, wherein each word 
in logical address space is mapped to 4 bytes in said 
address pages. 

7. The method as recited in claim 6, further including the 
55 step of providing one or more protection attributes for said 

memory. 

8. The method as recited in claim 7, wherein said pro- 
tection attribute is block protection of IK word blocks in 
logical memory space. 

6Q 9. The method as recited in claim 7, wherein said pro- 
tection attribute is access lock and key. 

10. The method as recited in claim 6, wherein said 
mapping is done in hardware. 

11. The method as recited in claim 10, wherein said 
^ hardware in a Power PC microprocessor. 

***** 
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PROCESSOR RESET GENERATED VIA 
MEMORY ACCESS INTERRUPT 

RELATED APPLICATIONS 

This patent application is related to the following patent 
applications: U.S. patent application Sen No. RO999-021, 
entitled "Apparatus and Method for Specifying Maximum 
Interactive Performance in a Logical Partition of a Computer 
System Independently from the Maximum Interactive Per- 
formance in Other Partitions " Ser. No. 09/314,541 filed 
May 19, 1999 by Armstrong et al.; U.S. patent application 
Ser. No. RO999-023, entitled "Management of a Concurrent 
Use License in a Logically-Partitioned Computer," Ser. No. 
09/314,324 filed May 19, 1999 by Armstrong et al.; U.S. 
patent application Ser. No. RO999-024, entitled "Event- 
Driven Communications Interface for Logically -Partitioned 
Computer," Ser. No. 09/4314,187 filed May 19, 1999 by 
Armstrong et al.; and U.S. patent application Ser. No. 
RO999-025, entitled "Logical Partition Manager and 
Method," Ser. No. 09/314,214 filed May 19, 1999 by Arm- 
strong et al. 

FIELD OF THE INVENTION 

The invention is generally related to computers and 
computer software. In particular, the invention is generally 
related to initiating a reset of a computer processor via a 
software-based mechanism. 

BACKGROUND OF THE INVENTION 

Computer technology continues to advance at a rapid 
pace, with significant developments being made in both 
software and in the underlying hardware upon which such 
software executes. One significant advance in computer 
technology is the development of multi-processor 
computers, where multiple computer processors are inter- 
faced with one another to permit multiple operations to be 
performed concurrently, thus improving the overall perfor- 
mance of such computers. Also, a number of multi -processor 
computer designs rely on logical partitioning to allocate 
computer resources to further enhance the performance of 
multiple concurrent tasks. 

With logical partitioning, a single physical computer is 
permitted to operate essentially like multiple and indepen- 
dent "virtual" computers (referred to as logical partitions), 
with the various resources in the physical computer (e.g., 
processors, memory, input/output devices) allocated among 
the various logical partitions. Each logical partition executes 
a separate operating system, and from the perspective of 
users and of the software executing on the logical partition, 
operates as a fully independent computer. 

A shared resource, often referred to as a "hypervisor" or 
partition manager, manages the logical partitions and facili- 
tates the allocation of resources to different logical parti- 
tions. As a component of this function, a partition manager 
maintains separate virtual memory address spaces for the 
various logical partitions so that the memory utilized by 
each logical partition is fully independent of the other 
logical partitions. One or more address translation tables are 
typically used by a partition manager to map addresses from 
each virtual address space to different addresses in the 
physical, or real, address space of the computer. Then, 
whenever a logical partition attempts to access a particular 
virtual address, the partition manager translates the virtual 
address to a real address so that the shared memory can be 
accessed directly by the logical partition. 
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A primary benefit of multi -processor computers, and in 
particular of those implementing partitioned environments, 
is the ability to maintain at least partial operational capa- 
bility in response to partial system failures. For example, 

s while most computers, and in particular most multi- 
processor computers, are relatively reliable, the processors 
in such computers can "hang" from time to time and cease 
to operate in responsive and predictable manners, e.g., due 
to software design flaws, or "bugs", that cause such proces- 

10 sors to operate continuously in endless loops. In a parti- 
tioned environment in particular, hanging a processor allo- 
cated to a particular logical partition often results in that 
partition becoming at least partially inoperative and non- 
responsive. However, other logical partitions that do not rely 

15 on the hung processor are typically not affected by the 
failure. 

While it may be acceptable in some situations to permit a 
computer to simply be powered off and on to recover from 
a hung processor, in many situations it is more desirable to 

20 provide the ability for a hung processor to be reset, or 
restored to a known state, in such a manner that the entire 
computer does not need to be shut down. Also, in a multi- 
processor computer, and in particular one that implements a 
partitioned environment, it is often desirable for such a reset 

25 operation to not affect other processors and/or other logical 
partitions operating in the computer so that the other pro- 
cessors and/or logical partitions can still perform useful 
operations while the hung processor is reset. 

In many multi-processor computers, and in particular in 

30 those implementing partitioned environments, a software- 
based reset mechanism is typically supported to permit one 
processor to initiate a reset of another processor. Typically, 
a software-based reset mechanism relies on the use of 
interrupts, often referred to as inter-processor interrupts 

35 (IPI's), to cause a hung processor to reset and restore itself 
to a known state. An IPI, like all interrupts, causes a 
processor to cease all current operations and immediately 
jump to dedicated program code, referred to as an "interrupt 
handler", to handle the interrupt. 

An IPI is typically handled as an "external" interrupt 
insofar as an IPI is initiated externally from the processor 
that receives the interrupt. Most processors, however, sup- 
port the ability to selectively enable or disable external 

45 interrupts so that such interrupts will be ignored — typically 
when a processor is executing relatively critical program 
code that should not be terminated prior to completion. The 
ability to disable external interrupts, however, introduces the 
possibility that a processor may hang while external inter- 

5Q rupts are disabled, and thus be incapable of being reset 
through an IPI. Should this occur, the only manner of 
resetting the processor would likely be a hardware reset, 
which would typically necessitate a full restart of the 
computer, and a consequent temporary inaccessibility of the 

55 computer. 

Therefore, a significant need exists for an alternate 
software -based reset mechanism for a processor that permits 
the processor to be reset in wider range of situations, and in 
particular, for a software -based reset mechanism for a pro- 
60 cessor that cannot be defeated as a result of the disabling of 
interrupts on the processor. 

SUMMARY OF THE INVENTION 

The invention addresses these and other problems asso- 
65 dated with the prior art by providing an apparatus, program 
product, and method that utilize a memory access interrupt 
to effect a reset of a processor in a multi-processor environ- 
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merit. Specifically, one processor (referred to herein as a FIG. 5 is a flowchart illustrating the program flow of a 

source processor) is permitted to initiate a reset of another partition manager interrupt handler executed by a target 

processor (referred to herein as a target processor) simply by processor in the computer of FIGS. 1 and 2, in response to 

generating both a reset request and a memory access inter- a memory access interrupt, 

rupt for the target processor. The target processor is then 5 

specifically configured to detect the presence of a pending DETAILED DESCRIPTION 
reset request during handing of the memory access interrupt, 

such that the target processor will perform a reset operation Hardware and Software Environment 
responsive to detection of such a request. 

Detection of a reset request is typically implemented 10 Turnin g t0 the Drawings, wherein like numbers denote 

within an interrupt handler that is executed by a target like parts throughout the several views, FIG. 1 illustrates a 

processor in response to a memory access interrupt. As a data processing apparatus or computer 10 consistent with the 

result, for those situations in which a memory access inter- invention. Apparatus 10 generically represents, for example, 

rupt is generated for a reason other than to initiate a reset of any of a number of multi-user computer systems such as a 

the target processor, the target processor can handle the network server, a midrange computer, a mainframe 

interrupt in an appropriate manner, and often with little 15 computer, etc. However, it should be appreciated that the 

additional overhead associated with determining whether a invention may be implemented in other data processing 

reset operation should be performed as a result of the apparatus, e.g., in stand-alone or single -user computer sys- 

interrupt. terns such as workstations, desktop computers, portable 

A memory access interrupt may be considered to include computers, and the like, or in other computing devices such 

any type of interrupt that is generated responsive to a 20 a s embedded controllers and the like. One suitable imple- 

memory access attempt by the target processor. Particularly mentation of apparatus 10 is in a midrange computer such as 

given the general necessity for a processor to always be the AS/400 series computer available from International 

capable of accessing memory, a memory access interrupt is Business Machines Corporation. 

often further characterized as being incapable of being Apparatus 10 generally includes a plurality of system 
disabled during the operation of the target processor. As a 25 processors n t0 a mem0 ry subsystem including 
consequence, unlike external interrupts such as IPFs and the main storage u e g ? an of dynamic ran dom access 
like which are capable of being disabled in some instances, memory (DRAM). System processors 12 may be configured, 
a reset operation can be initiated on a target processor via a for example> t0 implement a symmetric multiprocessing 
memory access interrupt irrespective of whether other inter- ^ SMp j environmentj among ot her possible multi-processor 
rupts are disabled on the processor. 30 environments. Also illustrated as interposed between pro- 
While other alternative memory access interrupt imple- cessors 12 and main storage 14 is a cache subsystem 16, 
mentations may also be utilized consistent with the typically including one or more levels of data, instruction 
invention, one particularly useful implementation relies on a and/or combination caches, with certain caches either serv- 
type of memory access interrupt that is generated in ing individual processors or multiple processors as is well 
response to an attempt by a target processor to access a 3 known in tne art Furthermore, main storage 14 is coupled to 
virtual memory address in a virtual memory address space a niun b e r of types of external (I/O) devices via a system bus 
that is not mapped by any entry in an address translation 18 and a p i ura ijty 0 f interface devices, e.g., an input/output 
table. Generation of a memory access interrupt then typi- bus attachment interface 20, a workstation controller 22 and 
cally requires only that one or more entries in the address a storage controller 24, which respectively provide external 
translation table be invalidated to ensure that a subsequent 40 access to one or more external networks 26, one or more 
access to the virtual memory address space will attempt to workstations 28, and/or one or more storage devices such as 
access an unmapped virtual memory address. a di rec t access storage device (DASD) 30. 

Therefore, consistent with one aspect of the invention, a In the illustrate d embodiment, computer 10 is imple- 

processor may be reset by generating a reset request for the ^ mMed ^ a logically part itioned computer. In this regard, 

processor, generating a memory access interrupt on the nG 2 illustrates in greater detail the primary software 

processor, and resetting the processor during handling of the compon ents and resources utilized in implementing a logi- 

memory access interrupt by the processor responsive to caUy partitioncd computing environment on computer 10, 

detection of the reset request. including a plurality of logical partitions 40, 42, 44 managed 

These and other advantages and features, which charac- 5o by a partition manager 46. Any number of logical partitions 

terize the invention, are set forth in the claims annexed may be suppor ted as is well known in the art. Moreover, it 

hereto and forming a further part hereof. However, for a win be appr eciated that the invention may be utilized in 

better understanding of the invention, and of the advantages omer partitioned environments, as well as in other comput- 

and objectives attained through its use, reference should be ers ( e g ^ non-partitioned computers) that utilize multiple 

made to the Drawings, and to the accompanying descriptive 5$ pr0 cessors. 

matter, in which there is described exemplary embodiments , n ^ mustra[ed ^mentation, logical partition 40 

of the invention. operates as a primary partition, while logical partitions 42 

BRIEF DESCRIPTION OF THE DRAWINGS ^ 44 opcr J as JJ nd&ry partitions< A * priraary partition 

FIG. 1 is a block diagram of a computer consistent with m ^ s C0Dlex t shares some of the partition management 

the invention. 60 functions for the computer, such as handling the powering 

FIG. 2 is a block diagram of the primary software on or powering off of the secondary logical partitions on 

components and resources in the computer of FIG. 1. computer 10, or initiating a memory dump of the secondary 

FIG. 3 is a block diagram of an address translation table logical partitions. As such, a portion of partition manager 46 

in FIG. 2. is illustrated by primary partition control block 50, disposed 

FIG. 4 is a flowchart illustrating the program flow of a 65 in the operating system 52 resident in primary partition 40. 

reset processor routine executed by a source processor in the Other partition management services, which are accessible 

computer of FIGS. 1 and 2. by all logical partitions, are represented by shared services 
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block 48. Implementation of partition management func- invention may be implemented in a number of manners, 

tionality within a primary logical partition is described, for including using various computer software applications, 

example, in U.S. patent application Ser. No. RO999-025, routines, components, programs, objects, modules, data 

entitled "Logical Partition Manager and Method, Ser. No. structures, etc., referred to hereinafter as "computer 

09/314,214, filed on May 19, 1999 by Armstrong et al. , 5 programs", or simply "programs". The computer programs 

which is incorporated by reference herein. However, parti- typically comprise one or more instructions that are resident 

tion management functionality need not be implemented al var i ous j n various memory and storage devices in 

within any particular logical partition in other implementa- me computcr> and that> when rcad and executed by one or 

Hons consistent with the invention. more processors in the computer, cause that computer to 

Each logical partition utilizes an operating system e.g 1Q perform thc stcps necessary t0 execute stcps or dements 

operating systems 52, 54 and 56 for logical partitions 40, 42 embodying lhe various of the inve ntion. Moreover, 

and 44, respectively), that controls the primary operations of ^ ^ invention faas aQd hereinafter win bc described in 

the logical partition in the same manner as the operating functioning computers, those skilled in 

system of a non-partitioned computer. For example, each - ^ M1 ■ * *l . *u • u a- p ,u 

operating system 52-56 may be implemented using the the art wdl a PP reci * e th ? t | Uie vanous embodiments of the 

OS/400 operating system available from International Busi- 15 mvention are capable of being distributed as a program 

ness Machines Corporation, among others, residing on top P rodu t ct in a ° f f onns > the invention applies 

of a kernel, e.g, AS/400 system licensed internal code equally regardless of the particular type of signal bearing 

(SLIC). The shared services in block 48 are alternatively medium used to actually carry out the distribution, 

referred to herein as partition licensed internal code (PLIC). Examples of signal bearing media include but are not limited 

Also illustrated are several user applications 58 that execute 20 to recordable type media such as volatile and non-volatile 

on each logical partition 40-44 and rely on the underlying memory devices, floppy and other removable disks, hard 

services provided by the operating systems thereof. disk drives, magnetic tape, optical disks (e.g, CD-ROM's, 

Each logical partition 40-44 executes in a separate DVD's, etc.), among others, and transmission type media 
memory space, represented by virtual memory 60. such as digital and analog communication links. 
Moreover, each logical partition 40-44 is statically and/or 25 [ n addition, various programs described hereinafter may 
dynamically allocated a portion of the available resources in 5e identified based upon the application for which they are 
computer 10. For example, each logical partition is allocated implemented in a specific embodiment of the invention, 
one or more processors 12, as well as a portion of the However> it should 5e appreciated that any particular pro- 
available memory space for use in virtual memory 60. nomenclature that follows is used merely for 
Logical .partitions can share specific hardware resources 30 conveniencej and lhus the invention should nol be limited t0 
such as processors, .such that .a given processor is utilized by ^ {n ^ lication idenlified and/or 
more than one logical partition. In the alternative hardware d guch nomenclature 

resources can be allocated to only one logical partition at a r \ _ ... 

t | me Those skilled in the art will recognize that the exemplary 

A j,.,. , « u i, ^ ~ environments illustrated in FIGS. 1 and 2 are not intended 

Additional resources, e.g, mass storage, backup storage, 3 j ( . _ , , tU , , . , u 

! * . « to limit the present invention. Indeed, those skilled in the art 
user input, network connections, and the like, are typically K . , iL u . \ , 
n /. / . , i • , ne ■ ' m will recognize that other alternative hardware and/or soft- 
allocated to one or more logical partitions in a manner well ? , , . . A , . r 
, *i_ , « u ii * a- i r ware environments may be used without departing from the 
known in the art. Resources can be allocated in a number of . J r t» 
manners, e.g, on a bus-by-bus basis, or on a resource -by- scope o e inven ion. 

resource basis, with multiple logical partitions sharing 40 Processor Reset Via Memory Access Interrupt 
resources on the same bus. Some resources may even be 

allocated to multiple logical partitions at a time. FIG. 2 The embodiments described hereinafter generally operate 

illustrates, for example, three logical buses 62, 64 and 66, by utilizing the existing memory access interrupt handling 

with a plurality of resources on bus 62, including a direct functionality of a processor to initiate a reset of the proces- 

access storage device (DASD) 68, a control panel 70, a tape 45 sor. A memory access interrupt can generally be considered 

drive 72 and an optical disk drive 74, allocated to primary to include any type of interrupt that is generated responsive 

logical partition 40. Bus 64, on the other hand, may have to a memory access attempt by a processor. For example, in 

resources allocated on a resource -by-resource basis, e.g, the implementation discussed hereinafter, a memory access 

with local area network (LAN) adaptor 76, optical disk drive interrupt is implemented as either or both of a data storage 

78 and DASD 80 allocated to secondary logical partition 42, 50 interrupt and instruction storage interrupt that is generated 

and LAN adaptors 82 and 84 allocated to secondary logical whenever a processor attempts to access a virtual memory 

partition 44. Bus 66 may represent, for example, a bus address in a virtual memory address space that is not mapped 

allocated specifically to logical partition 44, such that all to any real memory address by an address translation table 

resources on the bus, e.g, DASD's 86 and 88, are allocated utilized by that processor. However, it will be appreciated 

to the same logical partition. 55 that memory access interrupts may be generated in other 

It will be appreciated that the illustration of specific manners consistent with the invention, e.g, in response to a 

resources in FIG. 2 is merely exemplary in nature, and that miss of a translation lookaside buffer, among others, 

any combination and arrangement of resources may be A number of existing hardware and software components 

allocated to any logical partition in the alternative. in computer 10 are specifically utilized when initiating a 

Moreover, it will be appreciated that in some implementa- $o processor reset in the manner disclosed herein. For example, 

tions resources can be reallocated on a dynamic basis to as shown in FIG. 2, one or more address translation tables, 

service the needs of other logical partitions. Furthermore, it e.g., address translation tables 90, 92 and 94 (also referred 

will be appreciated that resources may also be represented in to as hardware page tables (HPT's)) are provided in partition 

terms of the input/output processors (IOP's) used to inter- manager 46 to respectively handle the virtual to real address 

face the computer with the specific hardware devices. 65 translation operations for logical partitions 40, 42 and 44, 

The various software components and resources illus- respectively. Moreover, as shown in FIG. 1, each processor 

trated in FIG. 2 and implementing the embodiments of the 12 optionally includes a translation lookaside buffer (TLB) 
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32 or other cache structure that caches at least a portion of partition 42 and used by target processor B (FIG. 2). As is 

one or more address translation tables to accelerate the well known in the art, an address translation table includes 

translation of virtual to real memory addresses, in a manner a plurality of entries, e.g., entry 102, including a plurality of 

well known in the art. fields 104, 106, 108 and 110. Address translation in the 

Each processor 12 typically also includes one or more s illustrated embodiment occurs on a page-by-page basis, e.g., 

levels of instruction cache, e.g., level one instruction cache with a page size of 4096 bytes. Each entry 102 thus matches 

(ICache) 34, within which one or more cache lines are a page of virtual memory address to a corresponding page of 

stored. Other relevant cache structures may also be found in rca j mem ory addresses in the memory system. The "page" of 

cache subsystem 16. Furthermore, as illustrated at 36, at a mem0 ry address is typically identified by those bits from 

least a portion of main storage is allocated for local storage 10 thc me address othcr ^ the lowest order number of 

for one or more processors, which local storage is statically bitg corresponding to the page size . for a page size of 

allocated to a fixed region of memory addresses to permit m(j ^ ^ * 64 . b / me address a is 

persistent access to the local storage at all times. ., , , iL a a-% if L*n- 

r ^ . . . . , , identified by the upper 42 bits (bits 0-41, where bit 0 is the 

To initiate a processor reset m the manner described MSB) wjth ^ , Qrder n bks ^ 4a _ e) utnized tQ 

herein, a first processor (referred to herein as a source „ .i , : A A 

' ,« , i • \ • - t . ( 4 f 15 specify a particular memory address in an identified page. As 

processor), which desires to initiate a processor reset of r , J . f, .„ t , , . 3 . t . c . , r r . 

, /' /^c a u„/L „~ „ such, in the illustrated implementation, field 104 of each 

another processor (referred to herein as a target . , , . , 

,,\ , •,,„„„, „ n „A „ entry 102 includes a 42-bit virtual page number, with entry 

processor), typically must generate a reset request and a +~J. , , . , , . . . , . 

memory access interrupt for the target processor. In FIG. 2, 106 includin S \ 4 ™>* number to which the virtual 

an exemplary source processor allocated to primary logical PJS* 15 ma PP ed " * should te appreciated that either or both 

partition 40 is illustrated at "A", and an exemplary target 20 ° f J he Vlrtual and real mem ° r y address spaces may have 

processor allocated to secondary logical partition 42 is differing sizes consistent with the invention, 

illustrated at "B". However, it should be appreciated that Each entry 1° 2 further includes a valid field 108 storing 

source and target processors may be allocated to the same a bit that indicates whether or not the entry represents a valid 

logical partition in some implementations. mapping of a virtual page to a real page. It is this bit that is 

In the illustrated embodiment, generation of a reset 25 cleared b V a source Processor whenever it is desired to 

request is implemented via setting a flag located at a static generate a memory access interrupt on a target processor that 

memory location in the local storage for the target processor. utilizes address translation table 92. 

The reset flag may alternatively be implemented in any other Additional information, represented by field 110, may 

memory storage device that is accessible (at least indirectly) also be stored within an entry 102 in an address translation 

to both the source and target processors. Moreover, other 30 table 92. Typically, such additional information includes 

manners of generating a reset request or otherwise indicating various protection bits, as well as reference, change, address 

to a target processor that a reset is requested will be compare and/or other information known in the art. It should 

appreciated by one of ordinary skill in the art having the be appreciated that other data structures may be utilized in 

benefit of the instant disclosure. an address translation scheme consistent with the invention. 

Also in the illustrated embodiment, generation of a FIG. 4 next illustrates a reset processor routine 120, 

memory access interrupt is implemented by invalidating, executed by a source processor whenever it is desirable to 

with the source processor, every entry in an address trans- reset a target processor to an initial state. Routine 120 may 

lation table associated with the target processor. Doing so be called, for example, whenever it is detected that a 

ensures that the next time the target processor attempts to 4Q processor, or a logical partition associated with that 

access any memory address (be it to retrieve a next instruc- processor, has become no n- responsive (here, after an unsuc- 

tion or to access data stored in memory), a memory access cessful IPI). Other situations in which it may be desirable to 

interrupt will be generated. In addition, to maintain reset a processor include power off, main store dump, and 

coherency, it is desirable to update any caching mechanisms continuously powered mainstore (CPM) initial program load 

(such as TLB's) to invalidate any cached entries from an 45 (IPL) of a secondary partition, among others, 

invalidated address translation table, and/or to update any First, as shown at block 122, the source processor may 

other caching mechanisms that cache data and/or instruc- attempt to send an inter-processor interrupt (IPI) to the target 

tions associated with any such invalidated entries. processor, in a manner known in the art. Next, the processor 

In other embodiments, it may not be necessary to invali- waits at block 124 for the target processor to return to a 

date every entry of an address translation table. For example, 50 known initial state, e.g., by setting a timer and periodically 

it may be possible in some embodiments to invalidate entries checking the responsiveness of the target processor. In such 

one at a time until a memory access interrupt is detected by an implementation, expiration of the timer without a 

the source processor. It may also be possible in some response from the target processor would indicate an unsuc- 

implementations to predict which entry will be accessed cessful reset operation. 

next, and only invalidate that entry. Furthermore, in other 5S Next, in block 126, it is determined whether the target 

embodiments different caching mechanisms may or may not processor has returned to its known initial slate. If so, routine 

need to be updated to maintain coherency. 120 is complete. If not, however, control passes to 128 to 

In addition, other mechanisms for generating a memory perform memory access-based processor reset consistent 

access interrupt may be utilized consistent with the inven- with the invention. 

lion. For example, as discussed above, a memory access 60 Specifically, in block 128, the source processor sets a reset 

interrupt may be initiated by generating a miss on a trans- fl ag i n the local storage for the target processor to be reset, 

lation lookaside buffer or other address translation data Next, in block 130, the source processor attempts to retrieve 

caching structure. the first valid entry in the address translation table allocated 

Other alternatives will be apparent to one of ordinary skill to the target processor. Assuming such an entry is found, 

in the art. 65 block 132 passes control to block 134 to invalidate the 

FIG. 3 illustrates in greater detail a suitable implementa- current entry, typically by clearing the valid bit therefor, 

tion of address translation table 92 allocated to logical Next, in block 136, any copy of the entry in the translation 
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lookaside buffer (TLB) for the target processor is invalidated 
by broadcasting a TLB invalidate entry message, the use and 
configuration of which is well understood in the art. 

Next, in block 138, all of the cache lines in the page that 
is being invalidated are invalidated in the target processor's 
instruction cache by broadcasting an instruction cache block 
invalidate (ICBI) instruction for each cache line referenced 
in the page. The use and operation of an ICBI instruction are 
well understood in the art. 

Blocks 136 and 138 essentially maintain coherency 
between the address translation table and any cached copies 
of any entries referenced thereby, as well as any cached 
copies of cache lines incorporated within any invalidated 
pages. It should be appreciated, however, that in other 
embodiments, coherency issues may not be present, and 
either or both of blocks 136 or 138 may be omitted. 

After broadcast of the ICBI instructions, control passes to 
block 140 to attempt to obtain the next valid entry in the 
address translation table associated with the target processor. 
Control then returns to block 132 to determine whether 
another such entry was found. Processing then continues 
until each valid entry in the address translation table has 
been invalidated. Once all such entries have been processed, 
block 132 passes control to block 142 to wait for the target 
processor to return to its known initial state. After this 
occurs, routine 120 is complete. 

Implementation of the functionality of routine 120 is 
typically within program code allocated to partition manager 
46 (FIG. 2), principally within shared services block 48. 
Specifically, block 50 in the primary logical partition portion 
of the partition manager determines whenever a processor 
needs to be reset, and initiates routine 120 in shared services 
block 48 to implement such a reset. Other allocations of 
functionality between blocks 48 and 50 may be used in the 
alternative, however. 

It should be appreciated that additional modifications may 
be made to routine 120 consistent with the invention. For 
example, rather than searching for only valid entries, all 
entries of an address translation table may be processed in 
the manner disclosed herein. Moreover, it may not be 
necessary or desirable in some embodiments to attempt an 
IPI prior to a memory access-based interrupt. Moreover, 
waiting for the target processor to return to its known initial 
state, as disclosed in connection with blocks 124 and 142 
may be performed in a number of alternate manners, includ- 
ing setting a watchdog timer, etc. Other modifications will 
be apparent to one of ordinary skill in the art. 

FIG. 5 next illustrates the initiation of a processor reset by 
the target processor in response to the generation of a 
memory access interrupt by the source processor (described 
above in connection with FIG. 4). As shown at 150, the 
target processor is illustrated as looping in operating system 
code with its interrupts disabled. In response to a data 
storage interrupt or instruction storage interrupt, control 
passes to a partition manager DSI/ISI interrupt handler 152 
executed by the target processor. Routine 152 is initiated any 
time a memory access interrupt is generated on the target 
processor, irrespective of whether the memory access inter- 
rupt was generated in response to a request to reset the 
processor. As such, routine 152 begins in block 154 by 
determining whether the reset flag allocated to the target 
processor in the local storage therefor is set. If so, control 
passes to block 156 to perform a processor reset and go to 
a known initial state in the partition manager code, in a 
manner well understood in the art. By returning to such a 
known initial state, the reset is achieved, in a manner 
specifically adapted for the particular configuration of com- 
puter 10. 

Returning to block 154, if the reset flag is not set, the 
memory access interrupt is handled in a conventional man- 
ner, e.g., by passing control to block 158 to call an interrupt 



i7,007 Bl 

10 

handler in the operating system code, as is also well known 
in the art. Typically, handling of such an interrupt includes 
determining whether the requested virtual address is valid, 
and if so, retrieving a real address from the operating 

s system's software page table. Then, the retrieved real 
address is stored in the address translation table in the 
partition manager, and normal processing is resumed. If the 
virtual address is not a valid address for the logical partition, 
an exception is created, which is handled in the operating 
system in a manner understood in the art. 

10 Various additional modifications may be made consistent 
with the invention. For example, other manners of insuring 
coherency between an address translation table and other 
components in the computer may be used in the alternative. 
Other modifications may be made to the illustrated 

15 embodiments without departing from the spirit and scope of 
the invention. Therefore, the invention lies in the claims 
hereinafter appended. 
What is claimed is: 

1. A method of resetting a processor, comprising: 
20 ( a ) generating a reset request for the processor; 

(b) generating a memory access interrupt on the proces- 
sor; and 

(c) resetting the processor during handling of the memory 
access interrupt by the processor responsive to detec- 

25 tion of the reset request. 

2. The method of claim 1, further comprising accessing 
data with the processor using a virtual memory system 
accessible through an address translation table associated 
with the processor, and wherein generating the memory 

30 access interrupt includes invalidating at least one entry in the 
address translation table. 

3. The method of claim 2, wherein invalidating at least 
one entry in the address translation table includes invalidat- 
ing every entry in the address translation table. 

4. The method of claim 3, wherein generating the memory 
35 access interrupt further includes invalidating a correspond- 
ing entry in a translation table cache associated with the 
processor. 

5. The method of claim 4, wherein generating the memory 
access interrupt further includes invalidating a cache entry, 
associated with a cache line referenced by an invalidated 
entry in the address translation table, and located in a cache 
associated with the processor. 

6. The method of claim 1, wherein the address translation 
table includes a plurality of entries, with each entry identi- 

45 fying a virtual memory address used by the processor and a 
real memory address mapped to the virtual memory address. 

7. The method of claim 1, wherein generating the reset 
request includes setting a reset flag. 

8. The method of claim 7, wherein the reset flag is stored 
50 in a local storage area for the processor. 

9. The method of claim 7, further comprising detecting the 
reset request in the processor by accessing the reset flag. 

10. The method of claim 1, wherein the memory access 
interrupt includes at least one of a data storage interrupt and 

55 an instruction storage interrupt. 

11. The method of claim 1, wherein resetting the proces- 
sor during handling of the memory access interrupt respon- 
sive to detection of the reset request is performed by a 
memory access interrupt routine executed by the processor 

60 responsive to a memory access interrupt. 

12. The method of claim 11, wherein generating the reset 
request and generating the memory access interrupt are 
performed on a second processor coupled to the first pro- 
cessor. 

65 13. The method of claim 12, wherein the first and second 
processors are among a plurality of processors in a multi- 
processor computer system. 
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14. The method of claim 13, wherein the multi-processor 
computer system defines a plurality of partitions, each 
partition including an operating system executing on at least 
one of the plurality of processors, the multi-processor com- 
puter system further including a partition manager executing 
on at least one of the plurality of processors, wherein 
generating the reset request and generating the memory 
access interrupt are performed by the partition manager. 

15. The method of claim 1, further comprising detecting 
whether the processor may be locked up with external 
interrupts disabled, wherein generating the reset request and 
generating the memory access interrupt are performed 
responsive to detecting that the processor may be locked up 
with external interrupts disabled. 

16. The method of claim 1, wherein generating the reset 
request and generating the memory access interrupt are 
performed responsive to determining that an attempt to reset 
the processor using an inter-processor interrupt was unsuc- 
cessful after determining that the processor is locked up. 

17. A method of resetting a first processor among a 
plurality of processors in a multi-processor computer 
system, comprising: 

(a) detecting with a processor other than the first proces- 
sor a likely lockup condition in the first processor while 
external interrupts on the first processor are disabled; 

(b) generating with a processor other than the first pro- 
cessor a reset request for the first processor; 

(c) generating with a processor other than the first pro- 
cessor a memory access interrupt on the first processor; 
and 

(d) handling the memory access interrupt in the first 
processor, including detecting the reset request with the 
first processor and performing a reset on the first 
processor responsive thereto. 

18. An apparatus comprising first and second processors, 
wherein: 

(a) the second processor is configured to initiate a reset of 
the first processor by generating a reset request and a 
memory access interrupt for the first processor; and 

(b) the first processor is configured to handle the memory 
access interrupt and to perform a reset responsive to 
detection of the reset request during handling of the 
memory access interrupt. 

19. The apparatus of claim 18, wherein the first processor 
is further configured to access data from a memory using an 
address translation table associated with the first processor, 
and wherein the second processor is configured to generate 
the memory access interrupt by invalidating at least one 
entry in the address translation table. 

20. The apparatus of claim 19, the second processor is 
configured to generate the memory access interrupt by 
invalidating every entry in the address translation table. 

21. The apparatus of claim 20, further comprising a 
translation table cache associated with the first processor, 
wherein the second processor is further configured to invali- 
date a corresponding entry in the translation table cache. 

22. The apparatus of claim 21, further comprising a cache 
associated with the first processor, wherein the second 
processor is further configured to invalidate a cache entry in 
the cache that is associated with a cache line referenced by 
an invalidated entry in the address translation table. 

23. The apparatus of claim 22, wherein the cache is a level 
one instruction cache. 

24. The apparatus of claim 18, wherein the second pro- 
cessor is configured to generate the reset request by setting 
a reset flag, and wherein the first processor is configured to 
detect the reset request by determining whether the reset flag 
is set. 
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25. The apparatus of claim 18, wherein the memory 
access interrupt includes at least one of a data storage 
interrupt and an instruction storage interrupt. 

26. The apparatus of claim 18, wherein the first processor 
s is configured to execute a memory access interrupt routine 

responsive to the memory access interrupt, and to detect the 
reset request during execution of the memory access inter- 
rupt routine. 

27. The apparatus of claim 18, wherein the first and 
second processors are among a plurality of processors in a 

10 multi-processor computer system, wherein the multi- 
processor computer system defines a plurality of partitions, 
each partition including an operating system executing on at 
least one of the plurality of processors, the multi-processor 
computer system further including a partition manager 
executing on at least the second processor. 

28. The apparatus of claim 18, wherein the second pro- 
cessor is configured to detect whether the first processor may 
be locked up with external interrupts disabled, and to 
generate the reset request and the memory access interrupt 
responsive to detecting that the first processor may be locked 

20 up with external interrupts disabled. 

29. The apparatus of claim 18, wherein the second pro- 
cessor is configured to generate the reset request and the 
memory access interrupt responsive to determining that an 
attempt to reset the first processor using an inter-processor 

25 interrupt was unsuccessful. 

30. An apparatus, comprising: 

(a) a memory defining a real address space with a plurality 
of real memory addresses; 

(b) a first processor configured to access the memory 
30 using a virtual address space including a plurality of 

virtual memory addresses; 

(c) an address translation table accessible by the first 
processor and including a plurality of entries, each 
entry configured to map a virtual memory address in 

35 the virtual address space to a real memory address in 
the real address space; 

(d) a memory access interrupt handler configured to be 
executed by the first processor in response to an attempt 
by the first processor to access an unmapped virtual 

40 memory address in the address translation table, the 
memory access interrupt handler further configured to 
reset the first processor responsive to a pending reset 
request; and 

(e) a second processor coupled to the first processor, the 
45 second processor configured to initiate a reset of the 

first processor by generating a reset request and invali- 
dating at least one entry in the address translation table. 

31. The apparatus of claim 30, wherein the plurality of 
processors are configured to implement a partitioned envi- 

50 ronment including a plurality of logical partitions. 

32. A program product, comprising: 

(a) first and second programs respectively configured to 
execute on first and second processors, the second 
program configured to initiate a reset of the first pro- 

55 cessor by generating a reset request and a memory 
access interrupt for the first processor, and the first 
program configured to handle the memory access inter- 
rupt and to perform a reset responsive to detection of 
the reset request during handling of the memory access 

60 interrupt; and 

(b) a signal bearing medium bearing the first and second 
programs. 

33. The program product of claim 32, wherein the signal 
bearing medium includes at least one of a recordable 

65 medium and a transmission-type medium. 

* + * * * 
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